CDS

Accession Number TCMCG022C35857
gbkey CDS
Protein Id XP_039165210.1
Location join(9023032..9023162,9035375..9035467,9035955..9036076,9036969..9037084,9038613..9038686,9055143..9055407,9055539..9055670,9056576..9056711,9056878..9056915)
Gene LOC120291617
GeneID 120291617
Organism Eucalyptus grandis

Protein

Length 368aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA698663
db_source XM_039309276.1
Definition pro-cathepsin H-like [Eucalyptus grandis]

EGGNOG-MAPPER Annotation

COG_category O
Description Belongs to the peptidase C1 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
KEGG_ko ko:K01366        [VIEW IN KEGG]
EC 3.4.22.16        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04142        [VIEW IN KEGG]
ko04210        [VIEW IN KEGG]
map04142        [VIEW IN KEGG]
map04210        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGTGCTGGTGATGAGGGATGATGACGAAGATGTGGGGTTGCTGGCGGTGACGGTGCAGAGGAGAAATGGAGAATACGTGAAGGGGAAAAGGGATGAGGCTGCTGGAGGCGTGAGGAAGAAAGAAGAAGGCAGCTGGCTTCAGGGCGCCGACCTCGAGTCCTCCATCCTCCAAACCGTTGGCCACGGCCGTCCCGCCCTCTCCTTCGTAGACTTTGCCAGCAGGTACGAGAAGAGGTACGAGACAGCGCAGGAGATCAAGTTGAGGTTCGATAATTACAGGGAGAATCTCAAGCTCATTCGATCCACCAACCAGAAGGGCTTGCCTTACACTCTCGCTGTTAATCAGTATGCTGACTGGAGCTGGGAGGAGTTCAAGACGCACAGACTGGGAGCTTCTCAAGACTGCTCTGCCACCACCAAGGGCAGCCACAAGCTCACCGACGCTGTTCTTCCCAAAACGAAAGACTGGAGAAAAGAGGGCATTGTAAGCCCAGTTAAAAATCAAGGCGGCTGTGGATCTTGCTGGAGTTTCAGCGCAACTGGAGCTCTCGAGGCTGCTTATCACCAAGCACACGGGAAAGGAATCTCTCTGTCTGAGCAGCAGCTCGTGGACTGCGCTACGGCTTTCAACAACTTTGGATGCGGTGGCGGGTTGCCGTCGCAAGCCTTCGAGTACATCAAGTACAACGGTGGCCTTGAGACCGAGGAAGCTTATCCTTACACTGCACGAAATGGTACCTGCAAATTCTCGGCTGGCAAGGTCGCTGTCAAAGTTGTCGACTCTGTCAACATCTCTATGGGTGCTGAGGATGAACTTAAGCATGCAGTTGGCCTGGTCCGGCCAGTCAGTGTGGCATTCCAGGTCACGGATGGCTTCCAGCTCTACGAGTCGGGTGTGTTCACCAGCGATGCATGTGGTAGCACTTCCATGGATGTGAACCATGCTGTTGTTGCTATCGGTTATGGAGTTGAGAACGGTGTTCCATACTGGCTTATCAAGAATTCCTGGGGAGAGAGCTGGGGCGACAAAGGATACTTCAAGATGGAGATGGGGAAGAACATGTGTGGTGTCGCTACTTGTGCATCATACCCTGTTGTGGCCTAG
Protein:  
MVLVMRDDDEDVGLLAVTVQRRNGEYVKGKRDEAAGGVRKKEEGSWLQGADLESSILQTVGHGRPALSFVDFASRYEKRYETAQEIKLRFDNYRENLKLIRSTNQKGLPYTLAVNQYADWSWEEFKTHRLGASQDCSATTKGSHKLTDAVLPKTKDWRKEGIVSPVKNQGGCGSCWSFSATGALEAAYHQAHGKGISLSEQQLVDCATAFNNFGCGGGLPSQAFEYIKYNGGLETEEAYPYTARNGTCKFSAGKVAVKVVDSVNISMGAEDELKHAVGLVRPVSVAFQVTDGFQLYESGVFTSDACGSTSMDVNHAVVAIGYGVENGVPYWLIKNSWGESWGDKGYFKMEMGKNMCGVATCASYPVVA